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The  Cold  Fish  War:  long-term  competition  in  a  dynamic  game 
I.  Introduction 

In  games  where  players  move  several  times,  it  is  reasonable  that  their 
choices  at  any  given  moment  should  depend  on  information  about  the 
previous  history  of  play.  This  dependence  has  been  exploited  to 
characterize  the  equilibrium  behavior  of  infinitely-repeated  games 
[supergames].  This  paper  applies  the  methods  of  analysis  developed  for  the 
supergame  to  an  example  of  a  more  general  class  of  games  called  dynamic 
games . 

The  "Great  Fish  War"  model  constructed  by  Levhari  and  Mirman  [1980]  pro¬ 
vides  an  interesting  and  tractable  example  of  a  dynamic  game.  It  concerns 
the  joint  exploitation  of  a  renewable  resource  by  several  players.  In 
their  paper,  Levhari  and  Mirman  apply  a  Cournot-Nash  solution  concept  that 
satisfies  an  additional  "backwards  induction"  rationality  constraint.  In 
order  to  do  this,  they  first  consider  a  situation  in  which  there  is  but  a 
single  future  period.  Assuming  equal  division  in  this  last  period,  they 
construct  the  payoff  functions  for  the  penultimate  period,  and  solve  for 
the  Nash  equilibrium  levels  of  e.xtraction  as  a  function  of  the  initial 
stock.  Repeating  the  process,  they  solve  for  equilibrium  behavior  in  each 
previous  period.  The  sequence  of  strategies  converges  to  a  limit  as  the 
induction  proceeds,  and  this  limit  determines  a  steady-state  equilibrium. 

The  essence  of  the  recursion  had  been  brought  out  by  Mirman  [1979],  wiiich 
formulated  the  decision  problem  faced  by  each  player.  The  extraction  at 
any  given  instant  was  chosen  to  maximize  the  value  of  current  consumption 
plus  the  expected  present  discounted  value  of  the  remaining  stock. 

Denoting  the  extraction  by  two  players  as  c(l)  and  c(2),  the  value  of 
current  consumption  by  player  i  as  U(i,c(i)),  the  discount  factor 
characterising  player  i  as  6.,  and  the  renewal  function  by  f, 

where  f(x)  is  the  stock  next  period  if  x  was  the  stock  at  the  end  of  the 
present  period,  the  value  to  player  i  of  a  stock  S,  V(i,S),  satisfies: 

(1)  V(i,S)  =  max  U(i,c(i))  +  6^V( i , f (S-c( 1 ) -c(2) ) ) 


This  is  essentially  the  same  as  the  backwards  induction  solution:  both 
depend  on  previous  history  only  through  the  current  stock,  and  if  backwards 
induction  converges,  it  will  converge  to  a  solution  to  (1).  Therefore, 
both  approaches  yield  an  equilibrium  in  "memoryless"  strategies. 

Those  papers  left  open  two  outstanding  questions,  which  have  implications 
beyond  the  specific  example.  The  first  is  that  a  single  externality  links 
the  two  players;  that  created  by  the  renewal  process.  If  the  resource  were 
to  be  sold  in  a  noncompetitive  market,  or  used  in  the  production  of 
commodities  sold  on  such  a  market,  additional  externalities  would  be 
introduced  [Levhari  and  Mirman,  op.  cit.  note  3J.  These  have  been  analyzed 
by  Keinganum  and  Stokey  [1981]. 

In  addition,  with  cither  tlie  "heuristic"  procedure  or  backwards  induction 
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current  consumption  depends  only  on  current  stock,  and  is  optimal  at  every 
stock  level.  Other  strategies  might  be  considered  which  contain  explicit 
or  implicit  threats  [Levhari  and  Mirman,  op.  cit.  page  323].  However,  as 
in  the  repeated-game  case,  simple  forms  of  dependence  can  add  a  great  deal 
to  our  understanding  of  equilibrium  behavior.  Indeed,  a  recent  paper  by 
Lewis  and  Cowens  [1983]  exploits  such  strategies  to  determine  the 
conditions  under  which  the  symmetric  cooperative  duopoly  outcome  can  be 
supported  by  a  perfect  equilibrium. 

The  strategies  we  shall  consider  are  obtained  under  relaxed  optimality 
conditions,  and  involve  more  explicit  dependence  on  the  past,  albeit  of  a 
particularly  simple  kind.  Typically,  strategies  will  consist  of  a  sequence 
of  extractions  to  be  followed  i f  no  player  deviates  from  prescribed 
behavior,  together  with  a  "punishment"  strategy  to  be  employed  if  there 
have  been  defections.  Weak  global  optimality,  which  requires  that  each 
player's  strategy  be  a  best  response  to  the  other  player's,  defines  Nash 
equilibrium.  Local  optimality,  which  requires  that  every  player's  choices 
in  every  conceiveable  contingency  be  weakly  globally  optimal,  defines 
subgame  perfect  equilibrium. 

This  paper  extends  the  analysis  of  "cooperative  equilibria"  of  the  Levhari- 
Mirman  example  by  using  methods  developed  for  repeated  games  to  study 
equilibria  which  Parcto-dominate  the  Cournot-Nash  equilibrium  of  Levhari- 
Mirman.  The  term  "Cold  War"  reflects  the  fact  that  such  "cooperative" 
behavior  is  supported  by  threats  which  are  not  carried  out.  The  first 
section  summarizes  the  repeated  game  tools  and  the  results  obtained  with 
them.  The  next  section  contains  the  Levhari-Mirman  example,  and 
characterizes  the  "cooperative"  equilibrium  outcomes.  The  third  section 
describes  a  sufficient  condition  for  subgame  perfection  of  an  equilibrium 
outcome  and  demonstrates  that  backwards  induction  optimality  is  not  implied 
by  perfection,  and  that  there  are  many  desireable  perfect  equilibria  that 
cannot  be  constructed  by  backwards  induction.  The  fourth  section  describes 
the  perfect  equilibria  in  the  nonrenewable  or  exhaustible  resource  case. 

The  last  section  contains  a  discussion  of  some  limitations  and  possible 
extens ions . 

II.  Equilibrium  and  Perfect  Equilibrium  in  Repeated  Games 

First,  let  us  consider  a  Prisoners'  Dilemma  game  repeated  twice.  Players 
are  informed  of  the  outcome  of  the  first  round  before  they  make  their  final 
choices,  and  each  maximizes  the  sum  of  his  payoffs.  The  one-shot  game 
is  : 


H  G 

+ . +-  ---  + 

H|(3, 3)1(0, 4)1 

+ . + . + 

G| (4, 0)1(1, 1)1 

+ . + . + 

Figure  1 


Applying  backwards  induction  to  this  game  (or  any  game  with  but  a  single 


one-shot  equilibrium)  we  should  expect  players  to  use  their  one-shot 
equilibrium  strategies  (here,  (G,G))  in  the  last  period,  regardless  of 
history.  The  payoff  function  for  the  first  period  is: 

+ . + . + 

1(4, 4)1(1, 5)1 
+ . + . + 

1(5, 1)1(2, 2)1 
+ . + . + 

Figure  2 

and  we  conclude  that  any  memoryless  sequence  of  equilibria  of  a  one-shot 
game  is  a  backwards  induction  equilibrium  of  the  finite  repetition  of  that 
game.  However,  there  are  other  equilibria.  Suppose  that  each  player 
decides  to  play  H  in  the  last  game  iff  (H,H)  was  played  in  the  first  game, 
and  to  play  G  otherwise.  The  first-stage  payoff  function  is: 

+ . + . + 

1(6. 6)1(1, 5)1 

+ - + - + 

1(5. 1)1(2, 2)1 
+ - + - + 

Figure  3 

This  game  has  3  equilibrium  points:  (H,H),  (G,G)  and  a  symmetric  mixed- 
strategy  equilibrium.  Of  course,  the  expectation  of  (H,H)  in  the  second 
period  would  probably  not  be  fulfilled  unless  the  players  could  credibly 
commit  themselves  to  it,  but  from  the  first-period  point  of  view  it  would 
still  be  an  equilibrium. 

If  the  one-shot  game  has  several  equilibrium  points,  then  backwards 
induction  does  not  eliminate  dependance  on  the  past.  Consider  the  two-fold 
repetition  of  Battle  of  the  Sexes: 


+ - + . + 

T|(3. 6)1(0, 0)1 

+ . + - + 

B| (0.0) I (6,3)1 

+ . + . + 

Figure  4 

Tliis  game  lias  three  equilibria  in  the  last  period:  if  p  (resp.  q)  is  the 
probability  of  player  1  (11)  using  strategy  T  (L),  then  the  one-shot  game 
has  equilibria  at  (p,q)  =  (0,0),  (1/3, 2/3),  and  (1,1),  with  payoffs  (3,6), 
(2,2),  and  (6,3).  Any  of  these  can  credibly  be  used  in  the  last  period 
without  violating  backwards  induction  rationality.  Depending  on  the  way  in 
which  these  are  "assigned"  to  outcomes  of  the  first  round,  a  variety  of 
equilibria  can  result.  If  the  players  decide  on  a  single  equilibrium  to 
use  in  the  last  round,  no  matter  what  the  previous  history,  then  the  first- 
round  game  will  differ  by  a  constant  from  the  one-shot  game  and  will  have 
the  same  equilibrium  strategics.  On  the  other  hand,  suppose  that  the 


players  agree  to  follow  (1,1)  with  (1,1);  (0,0)  with  (1/3, 2/3)  and  anything 
else  with  (0,0).  The  resulting  payoffs  are: 


+ . -+ . + 

I (6,12) I (6,3)  I 

+ . + . + 

|(b.3)  U8,5)| 

+ . + . + 

Figure  5 

Figure  6  shows  the  set  of  backwards  induction  equilibrium  average  payoffs 
for  the  2-stage  game. 


Figure  6 


0.^— memory  less  equilibria 


In  the  undiscounted  infinitely-repeated  game,  the  limiting  average  payoff 
may  not  e.xist.  However,  we  can  restate  the  Nash  equilibrium  requirement 
as:  a  player  will  not  defect  if  the  I im  inf  of  his  equilibrium  average 
payoffs  e.xceeds  the  1  im  sup  his  average  payoffs  obtained  by  a  unilateral 
change  of  strategy.  With  this  definition,  all  equilibrium  sequences 
converge,  and  we  can  use  the  "grim"  punishments:  any  defection  is  to  be 
punished  by  holding  the  defector  to  his  minmax  payoff  in  every  future 
round.  This  leads  to  the  Folk  Theorem  (cf  Aumann,  1976]:  every  payoff  that 
is  feasible  (in  the  convex  hull  of  the  one-shot  payoffs)  and  individually- 
rational  (gives  each  player  at  least  his  minmax  payoff)  is  the  payoff  to  an 
equilibrium  of  the  undiscounted  repeated  game,  and  vice  versa. 

If  future  payoffs  are  discounted,  the  immediate  payoff  to  a  defection  may 
become  more  important  than  the  eventual  punishment,  so  the  equilibrium  set 
cannot  be  described  in  terms  of  payoffs  alone. 

A  subgame  perfect  equilibrium  is  one  in  which  the  chosen  strategies  induce 
equilibrium  behavior  in  every  subgame.  In  repeated  games,  subgames  are 
defined  by  histories  of  play  in  previous  rounds.  In  the  undiscounted 
infinitely-repeated  game,  the  set  of  payoffs  to  subgame  perfect  equilibria 
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is  the  same  as  the  set  of  equilibrium  payoffs.  To  see  this,  it  is 
sufficient  to  define  a  sequence  of  "punishment  limits"  e(t)  that  tends  to 
0.  If  a  player  defects  from  the  equilibrium  strategy  behavior  at  round  t, 
he  will  bo  punished  by  repeatedly  being  held  to  his  minmax  payoff  until  his 
cumulative  average  payoff  exceeds  his  minmax  payoff  by  no  more  than  e(t). 

At  this  point,  play  returns  to  the  cooperative  sequence.  Whatever  residual 
profit  the  player  may  have  disappears  in  the  limit,  unless  he  defects 
again,  in  which  case  lie  faces  a  longer  punishment,  etc.  If  another  player 
tails  to  punish  in  the  appropriate  fashion,  then  that  player  becomes  the 
do  f  ector . 

This  result  also  fails  to  carry  over  to  the  discounted  game.  In  some 
eases,  e.g.,  when  there  is  an  equi’ibrium  of  the  one-shot  game  giving  the 
players  their  minmax  payoffs  (Prisoners'  Dilemma,  Battle  of  the  Sexes, 
etc.),  the  grim  punishment  of  a  defector  is  itself  an  equilibrium,  so  the 
s('ts  of  equilibrium  and  p<>rfect  equilibrium  payoffs  are  identical  [Cave, 
lh7')],  hut  there’  are  many  e.xamfiles  where  this  is  not  the  case. 

There  are  other  notions  of  perfection  that  are  stronger  than  subgame 
perfectness.  Selten  [lh751  has  defined  the  concept  of  trembling-hand 
perf(’ction,  which  retpiires  tlie  existence  of  a  sequence  of  completely-mixed 
strategies  converging  to  the  equilibrium  strategies,  with  the  property  that 
no  inferior  strategy  of  any  player  is  given  more  than  asymptotically  0 
weiglit.  In  two  player  games,  or  games  where  each  player's  payoff  depends 
only  on  the  aggregate  of  the  other  players'  strategies,  this  requirement  is 
equivalent  to  the  simpler  condition  that  no  player  uses  a  weakly  dominated 
strategy  wit!^  positive  probability.  That  this  is  stronger  than  subgame 
porf<'ction  can  be  seen  in  the  example  shown  in  Fig.  5,  where  only  the 
(upiilibrium  (0,0)  witli  payoff  (8,5)  is  trembling-hand  perfect,  although  all 
equilibria  are  subgame  perfect  by  construction. 

In  the  dynamic  s(!tting,  one  cannot  make  reference  to  the  one-shot  game  in 
the  same  way,  but  there  are  certain  similarities.  Strategies  can  still  be 
divided  into  specified  beliavior  (the  choices  that  follow  histories  that 
occur  with  positive  probability)  and  "punishments"  (the  rest  of  the 
choices).  We  can  sjjccify  a  sequence  of  moves  in  each  period  and  check  to 
see  whether  there  e.xists  a  punishment  that  will  make  players  adhere  to  this 
sequence.  For  subgame  perfect  equilibria,  we  require  that  the  punishment 
seiiuence  itself  be  an  equilibrium.  However,  instead  of  using  a  repeated 
one-shot  e()ui librium  for  punishing  defections,  we  shall  use  the  Nash- 
Cournot  equilibrium  obtained  by  either  backwards  induction  or  "heuristic" 
methods . 

II.  F.qui  librium  in  an  example  of  a  dynamic  game 

This  e.xample  is  drawn  from  Levhari  and  .Mirman  (op.cit.,  Section  2).  There 
is  an  initial  stock  of  fish  S(0).  If  the  stock  at  the  beginning  of  period 
l  is  S(t)  and  nation  i  (=  1,2)  extracts  the  fraction  h(i),  wheie  h(l)  + 
h('2)  is  <  1  ,  tlie  stock  at  the  beginning  of  period  t+1  will  be: 

(2  j  S( If  1  )  ^  [ (  1 -h( 1 )-h(2))S(t)  )“ 


for  some  a  £  [0,1],  Each  player  has  the  utility  function  U(i,c(i))  = 
ln(c(i))  and  discounts  the  future  using  the  discount  factor  6^ 

£  (0,1].  By  virtue  of  the  logarithmic  utilities  and  exponential  growth 
and  discounting,  we  shall  limit  our  attention  to  constant  cooperative 
sequences,  whore  each  country  in  each  period  consumes  a  fixed  proportion  of 
the  available  stock. 

If  a  nation  defects,  by  consuming  or  extracting  a  different  proportion  of 
the  available  stock,  this  will  become  apparant  during  the  next  period  (the 
growth  law  is  deterministic)  and  can  be  punished  then. 

If  the  constant  fractions  lr-''(l)  and  h*(2)  are  used,  the  time  paths  of  stock 
and  consumption  are  given  by: 


(3)  S(t)  =  [  l-h-'-(l  )-h--'  (2)|®^*'^S(0)“ 

and 

(4  )  c  (  i  ,  t )  =  h-''(  i  ).S  ( t  ) 

wlie  r(! 

a(  1  -a*" ) 

(  5  )  g  ( t  )  = _ 

1  -a 


The  present  discounted  value  of  using  the  strategies  h*  is 

ln(lr‘-(i))  ln(S(0))  a6.1n(l-h----(l)-h''-(2)) 

( t) )  V(i,h")  = _ +  _  +  _ _ _ 

1-6,  l-a6.  (1-6.)  (l-a6,) 

1  1  1  1 

which  we  can  simplify  by  multiplying  through  by  (1-5^)  (l-a6^) 

The  strongest  punishment  that  can  be  inflicted  is  immediate  consumption  of 
the  remaining  stock  of  fish.  The  payoff  to  both  players  if  this  is  done  is 
-  •»  (provided  6^  >  0);  hence  any  h-'-  is  an  equilibrium  h- . 

III.  Perfect  Equilibrium 

In  this  section,  we  shall  find  tlio  h-  which  can  be  sustained  when 
defections  are  met  with  an  immediate  and  permanent  return  to  the  Cournot- 
Nash  extraction  path;  this  will  give  us  a  subset  of  the  perfect  equilibrium 
payof  f  s . 

First,  we  can  use  the  payoff  function  (6)  above  to  determine  the  Cournot- 
Nash  equilibrium.  The  best  -  response  functions  are; 


(7)  h^’d)  =  (l-a6,)(l-h(j))  =  (l-e.)(l-h(j)) 


and  the  Nash  Equilibrium  extractions  are: 


(8)  h“(i)  = 


The  value  of  noncooperative  play  starting  with  a  stock  of  S  is  therefore: 

(9)  (l-6.)(l-e.)V"(i,S)  =  (l-e.)ln(l-e^)  +  B.lnC&^l  +  Ince^) 

-  +  (l-6.)ln(S) 

To  find  the  best  defection  from  h‘“  ( 1)  ,h'“(2)  possible  when  the  stock  is  S,  and 
defections  are  punished  by  a  return  to  noncooperative  play,  player  i  solves: 

(10)  max  In(hS)  +  6^ ( v"( i ,s“( l-h-h*( j ) )“) ) 


The  best  defection  is  the  same  as  the  best  defection  against  stationary  play 
of  h---(j): 

(11)  h^(h*(j))  =  (l-ep(l-h--.-(j)) 

The  payoff  that  i  can  anticipate  if  he  makes  this  best  defection  is: 

(  (l-B.)ln(l-e.)+&.ln(6p+6.(ln(&^)-ln(e^+&2-BjB2)) 

(12)  +(1 -6.  )(l-.(S)  +  ln(l-h*(j  )))!/(( 1-6.  )(1-Bpi 

We  can  now  write  the  perfect  equilibrium  condition.  Defining  K(i)  to  be  the 
expression 

(l-6.)ln(l-ep+B.ln(Bp+6.(ln(Bj)-ln(&j+&2-&jB2^^ 

we  have 

(13)  (l-B.)ln(h---(i))  +  B.ln(l-h*(l)-h*(2))  -  ( 1 -6  . )  In  ( 1 -h*(  j  )  )  >  K(i) 

As  a  check,  it  will  be  noted  that  the  Cournot-Nash  Equilibrium  shares 
h^  satisfy  these  constraints  with  equality. 

Now  fix  the  total  proportion  extracted  at  h*  and  use  h*(l)  +  h'"(2)  =  h*  to 
rewrite  (13)  as: 

(14)  f(h--'-(l)  ,h-”-)  >  0 


(15)  f(h-"--h-"-(l)  ,h----)  >  0 


Those  are  monotone  increasing  and  decreasing  functions  of  h*(l), 
respectively,  so  that  (14)  provides  the  lower  bound  on  perfect  equilibrium 


h'''(l),  while  (15)  provides  the  upper  bound.  Figure  7  shows  these  functions. 


Wo  shull  now  e.xamine  the  responsiveness  of  the  upper  and  lower  bounds,  which 
wo  dotiot('  y(h-')  and  .v;(h")  .  respectively,  to  changes  in  h* 

The  total  derivative  of  (14)  with  respect  to  x  and  h*  can  be  set  equal  to  0 
to  give  (for  i  =  1 )  : 

(l-&j)  (1-6^)  B.  (1-6^) 

(lb)  [ _ -  _ ]  dx  =  ( _  -  _ )  dh" 

X  l-lV"'+x  1-h"  l-h*+x 


The  coefficient  of  dx  is  positive,  while  that  of  dh‘"  varies  with  the  values 
f  h'“  and  x:  x(h")  increases  with  h“  iff: 

(l-6j-Bp(l-h-'-) 

(17)  X ( h )  >  _ _ =  X" 

Since  f  is  monotone  increasing,  x(h"')  is  increasing  in  h*  iff 

(18)  f  ( .X-'- , h- )  <  f  (xdr---)  ,h*)  =  0 


This  will  be  the  case  iff  h--'  is  at  least  h**,  where 


1-B, 

(19)  h--'--  =  cxp[  [_  1  In] 


1-^1  '-'l 

_ 1  +  [ _ )ln[l-6^]  +  ln[B^+B2-3^B2l 

1-6  -B  6 

1  ^1  1 


+  In(Bj)  -  ln(B2)) 


We  can  do  the  same  thing  with  y(h*):  total  differentiation  gives: 


Tho  coefficient  of  dy  is  negative,  so  that  y(h*)  increases  with  h*  iff: 


(21)  y  >  y--'--  =  1  -  (l-h----)/3. 


Again  exploiting  the  monotonicity  of  f,  this  means  that  y(h‘")  will  increase 
with  h"  iff 


(22)  0  =  f(h----y(h*)  ,lr---)  <  f(h--'-->-v^h*) 

This  will  be  the  case  iff  Ir-'-  <  h°,  where 


(23) 


li  =  1 


Tho  set  of  equilibrium  shares  is  shown  in  Fig.  8. 


A  [);i  r  l  i  cu  1 .1  r  ttJtal  li'\cl  of  I'.xt  r.u.l  j(5ii  <..ni  be  sustaiiu'd  by  a  p(?rfect 
e()ii  i  1  i  br  i  urn  of  this  sirn;)!c  form  iff  there  e.xists  x  such  that 

124)  f  I  z  ,h  I  =  f  1  h  ,  h  i  >  0 

Monotonicity  of  1  implies  that  tliere  is  a  uiiicpu'  v.  satisfying  the  equation 
ill  (24),  which  Wi’  liiuiote  by  /.  I  h  )  .  Substituting  zlh),  w('  get 

(  2  i  )  C;  (  li  1  =  f  (  1  h  )  ,  h  )  =  1  1  h  -  z  (  h  )  ,  h  )  2  0 

I'rom  this  it  is  ev  ident  that  (1(0  l  =  G(  1  )  =  -«■.  Moreover,  the  ('.xistcnce 
oi  Toil  rnot -Nasli  r-qii  i  i  i  br  i  urn  shares  shows  that  (  2  ”> )  is  not  vaccuous.  G  can 
also  be  shown  to  he  (.on<-ave  in  li .  from  which  we  conclude  that  the  set  of 


1 


-  10  - 


susLainablo  total  extractions  is  convex.  While  this  is  complicated  in  the 
general  case,  G  Ix^comes  much  more  tractable  if  we  assume  that  both  agents 
have  the  samt;  discount  factor,  6.  W<!  can  also  define  &  to  be  the 
common  value  B  =  In  tliis  case,  z(li)  is  h/2,  and  G  can  be  written; 

(2b)  G(h)  =  (l-B)ln(h)  +  Bln(l-h)  -  (l-6)ln(2-h)  -  (l-B)ln(l-B) 

-  Bln(B)  +  61n(2-B)  -  (6-B)ln(2) 


In  this  case,  the  concavity  of  G  is  easy  to  verify: 

1-B  B  1-6 

(2^)  G"(h)  =  -[  +  _ +  _ I  <  0 


(I-h)^  (2-hr 


Figure  9 

We  can  also  do  some  limited  comparative  statics,  to  see  the  effect  of  changes 
in  the  parameters  on  the  set  of  sustainable  total  extraction  rates. 

3G  [ ln(2-h)+ln(l-B)-ln(h) 1  B 

(28)  _  = _ -  _ 

36  "  6  2-B 

which  is  positive  iff; 

(29)  h  <  2/(R+l) 
where 

(29' )  ln(R)  =  (B6)/(2-Bj  -  In(l-B) 

On  the  other  hand,  the  sign  of  the  coefficient  of  dh  is  the  same  as  the 
sign  of 

(30)  (1-B)  -  (2-B+6)h  +  6h^ 


1 


,1 


J 


1 

i 


t 


« 


which  follows  the  pattern  as  h  increases.  If  the  coefficients  are  of 


the  same  sign  at  the  maximal  extraction  rate  and  of  opposite  signs  at  the 
minimal  extraction  rate,  then  the  set  of  sustainable  extractions  increases 
with  increases  in  6.  This  is  what  we  would  expect  from  the  analogy  with 
repeated  games,  but  the  dynamic  game  is  complicated  by  the  renewal  effect. 
At  the  maximal  rate  of  extraction,  each  player  is  just  kept  from  increasing 
his  catch  by  the  future  reward.  If  the  players  become  more  far-sighted 
(6  increases),  the  value  of  this  reward  increases.  On  the  other  hand, 
if  6  falls  to  0,  only  h=l  is  sustainable,  and  it  is  sustainable  with  any 
shares!  This  does  not  look  like  the  repeated-game  situation  where 
increasing  myopia  shrinks  the  set  of  equilibria  to  the  one-shot  equilibria. 
In  this  example,  we  would  have  to  solve  for  the  maximal  and  minimal 
sustainable  extraction  rates  explicitly  as  functions  of  a  and  6  from; 

(31)  h^‘^(l-h)^(2-h)®‘^  =  (l-p)^‘^8^2®'^(2-e)‘^ 

We  shall  defer  further  comparative  statics  analysis  to  the  exhaustible 
resource  discussion  in  Section  IV. 

We  now  turn  to  the  question  of  the  sustainability  of  Pareto  Optimal  levels 
of  extraction. 

In  the  symmetric  case,  the  Pareto  Optimal  levels  are  found  by  solving: 

(32)  max  wV( 1 ,h( 1) ,h(2) )  +  ( 1 -w)V(2 ,h( 1 ) ,h(2) )  subject  to 

h(i)  5  0,  each  i,  and 
h(l)  +  h(2)  <  1 

for  fixed  w  e  (0,1).  The  logarithmic  form  of  the  V(i)  ensures  that  the 
constraints  will  not  be  binding  at  the  optimum.  The  first-order  conditions 
are : 

1-8  8 

(33)  w[ _ ]  =  [ _ I 

h(l)  l-li(l)-h(2) 


1-8  8 

(3A)  (l-w)[ _ ]  =  [ _ 1 

h(2)  l-h(l)-h(2) 

These  can  be  solved  for  interior  Pareto  Optima: 
h(l)  =  w(l-8) 

(35) 

h(2)  =  (l-wl(l-8) 

and  the  Pareto  Optimal  shares  are  those  summing  to  (1-8). 

I'rom  the  previous  analysis,  we  conclude  that  there  exists  a  Pareto  Optimal 
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perfect  equilibrium  of  this  form  iff 
1-6 

(36)  lii(2-&)  - _ ln(l+3)  >  (l-a)ln2 

6 

In  the  non-symmetr ic  case,  the  Pareto-optimal  total  extractions  are 
sensitive  to  the  welfare  weights:  in  other  words,  it  is  no  longer  true  that 
all  Pareto  Optimal  extraction  levels  are  the  same.  Solving  the  general 
welfare  problem,  we  can  see  that  (h(l),h(2))  solves  (32)  iff 

(37)  h(l)  =  w(l-e^)  h(2)  =  (l-w)(l-&2) 

IV.  The  Exhaustible  Resource  Case 

An  interesting  special  case  is  that  of  a  non-renewable  resource,  obtained 
by  setting  a  =  1.  We  further  limit  attention  to  the  symmetric  case. 

Conditions  (13)  for  a  perfect  equilibrium  of  the  form  we  have  been  studying 
simplify  to 


h(l)  6 

(38)  (l-6)ln( _ 1  +  61n(l-h)  >  (l-6)ln(l-6)  +  61n( _ ] 

l-h+h(l)  2-6 

h-h(l)  6 

(38)  (l-6)ln( _ 1  +  61n(l-h)  >  (l-6)ln(l-6)  +  61n[ _ ) 

1- h(l)  2-6 

If  we  insert  the  intersection  value  h(l)  =  h/2,  we  obtain  the  condition  for 
h  to  be  sustainable: 

h  6 

(40)  (l-6)ln[ _ 1  +  61n(l-h)  >  (l-6)ln(l-6)  +  61n[ _ ) 

2- h  2-6 

Now  consider  the  case  6  =  1/2.  Equation  40  is  quadratic,  and  gives: 

(41)  1/2  <  h  <  2/3 

Wo  can  solve  for  the  equilibrium  shares  h(l)  and  h(2)  using  (38)  and  (39). 
This  gives  two  equations  relating  the  shares: 

(42)  (l-6h(l))h(2)  >  1  -  6h(l)(l-h(l)) 

(42)  (l-6h(2))h(l)  >  1  -  6h(2)(l-h(2)) 

These  shares  are  shown  in  Figure  10  below. 


A  slightly  more  complicated  case  occurs  at  6  =  2/3:  in  this  case,  (40) 
is  equivalent  to  the  cubic  equation: 


(43) 


12h  -  24h  +  13h  -2  >  0 


and  the  sustainable  total  extractions  satisfy; 

(44)  0.2713  <  h  <  0.5000 

The  equations  relating  the  shares  of  the  two  players  are: 

(45)  12h(i)  [  l-h(l)-h(2)  +  h(j)  -  1  >  0  for  i  j  E  {1,2} 

What  is  interesting  about  these  two  examples  is  the  fact  that  increasing 
myopia  increases  the  sustainable  levels  of  extraction.  In  one  sense,  this 
is  what  we  would  expect  by  analogy  with  the  repeated  game  results:  as  the 
players  get  less  myopic,  the  results  are  "nicer";  at  any  6  =  2/3  perfect 
equilibrium  of  this  form,  the  time  path  of  stocks  dominates  that  at  any 
6  =  1/2  equilibrium.  In  another  sense,  the  result  is  surprising,  since 
increases  in  the  discount  factor  for  repeated  games  are  associated  with 
increases  in  the  set  of  equilibrium  outcomes,  whereas  this  example  seems  to 
suggest  a  shift  in  the  set. 

The  impression  that  something  different  is  going  on  is  reinforced  if  we 
compute  the  comparative  statics  results  near  the  two  discount  factors: 


6  =  2/3 


6  =  1/2 


min imum 

maximum 

minimum 

.2713 

.  5 

.5 

.5067 

- . 4444 

.3333 

.6299 

- .  5 

.4776 

-1.2431 

-1.1251 

-1.4328 

maximum 

.6667 

-.3750 

-.3333 

-0.8889 


At  both  points,  small  increases  in  the  discount  factor  are  associated  with 
decreases  in  both  the  maximum  and  minimum  sustainable  rates  of  extraction, 
with  the  latter  falling  faster  than  the  former. 

Of  course,  to  say  that  the  stock  is  being  exhausted  at  a  slower  rate  does 
not  really  mean  that  the  players  are  "more  cooperative."  In  order  to  make 
sucii  a  statement,  we  would  have  to  assert  something  about  the  optimality  of 
the  sustainable  extraction  rates.  Even  a  monopolist  will  depelete  stocks 
at  a  slower  rate,  the  higher  is  his  discount  factor. 

There  is  a  Pareto  Optimal  perfect  equilibrium  of  the  form  we  have  been 
studying  if 


61n(2-6)  >  (l-6)ln(l+6) 


which  is  to  say  6  ^  1/2.  The  shares  at  such  an  equilibrium  satisfy 


(A7) 


In(hri))  -  ln(6+h(l))  >  ln(l-6)  -  ( 6/ ( 1 -6 ) ] In ( 2-6 )  e  ln(m) 
ln(l-6-h(l))  -  ln(l-h(l))  >  ln(m) 


(A8) 


So  the  equilibrium  shares  satisfy 

1-6-m  m6 

(49)  _  >  h(l)  >  _ 

1-m  1-m 


This  interval  is  nonempty  as  long  as  6  >  1/2,  and  may  well  be 
nontrivial.  For  e.xample,  if  6  =  .75,  we  get  h(l)  t  [  .  1 10 1 ,  .  1399  ]  . 
Therefore,  even  in  the  simple  case  of  a  nonrenewable  resource  extracted  by 
identical  duopolists  at  the  Pareto  Optimal  rate,  there  is  still  room  for  a 
range  of  perfect  equilibrium  outcomes. 

We  shall  close  this  discussion  of  the  exhaustible  case  by  describing  the 
outcomes  in  utility  terms,  using  (6).  Neglecting  the  utility  of  the 
initial  stock,  the  utility  of  player  i  is  proportional  to: 

( 1-6)  ln(h(i))  +  61n(l-h(l)-h(2)) 

At  the  maximal  and  minimal  rates  of  e.xtraction,  the  players  each  consume 
the  same  proportion  of  the  existing  stock.  If  total  extraction  is  h, 
utility  is  therefore  proportional  to: 

(l-6)ln(h)  +  61n(l-h) 


However,  it  is  not  necessarily  the  case  that  extremes  of  total  extraction 
are  associated  with  extreme  points  of  utility.  In  fact,  the  only  case 
where  this  does  happen  is  at  the  point  6  =  1/2,  where  there  is  a  unique 
Pareto  Optimal  perfect  equilibrium  corresponding  to  the  slowest  perfect 
equilibrium  rate  of  extraction.  In  all  other  cases,  the  utilities 
associated  with  perfect  equilibria  are  a  "folded”  image  of  the 
corresponding  set  of  perfect  equilibrium  extraction  rates. 


To  show  this 
equ i 1 ibr ium 
2/3. 


Figures  10  and 
shares  and  the 


11  present  the  set  of  sustainable  perfect 
utilities  associated  with  them  for  6  =  1/2 


and 


Figure  10:  6  =  1/2 


Pareto-optimal 


1  ( 

+ . - . h(l) 

sustainable  shares 


Figure  11:  6  =  3/A 

The  folding  occurs  at  greater  discount  factors  because  higher  total  rates 
of  extraction  are  associated  with  more  rapid  exhaustion  of  the  resource, 
and  thus  with  lower  total  utility.  However,  they  are  still  perfect  equil¬ 
ibrium  outcomes. 

To  determine  the  size  of  the  fold,  or  the  range  of  Pareto  Optimal  utilities 
associated  with  perfect  equilibria,  we  fix  the  discount  factor,  and  thus 
total  extraction,  and  vary  the  individual  shares  using  the  perfect 
equilibrium  conditions.  The  following  table  gives  the  maximal  and  minimal 
Pareto  Optimal  utilities  for  a  player  at  various  discount  rates.  The  set 
of  utilities  is  symmetric,  so  we  give  only  one  set. 


5 

h* 

min  h( 1 ) 

max  h(l) 

min  V( 1 ) 

max  V(l) 

.5 

.  5 

.25 

.25 

-2.0794 

-2.0794 

.  55 

.45 

.22 

.23 

-2.2005 

-2.2446 

.6 

.4 

.  191 

.209 

-2.3317 

-2.4217 

.65 

.35 

.  163 

.  187 

-2.4765 

-2.6143 

2/3 

1/3 

.  1538 

.  1795 

-2.5826 

-2.6827 

.  7 

.3 

.  1360 

.  1640 

-2.6399 

-2.8276 

.75 

.25 

.  1101 

.  1399 

-2.8298 

-3.0695 

.8 

.  2 

.0854 

.  1146 

-3.0589 

-3.3530 

.85 

.  15 

.0620 

.0880 

-3.3509 

-3.7022 

.9 

.  1 

.0399 

.0601 

-3.7593 

-4.1706 

.95 

.05 

.0192 

.0308 

-4.4541 

-4.9286 

.99 

.01 

.0037 

.0063 

-6.0939 

-6.5915 

Fn 1 i kr 

the 

repea ted -game 

situation,  the 

normalized 

utilities  are 

directly  comparable,  as  can  be  seen  in  the  fact  that  the  more  far-sighted 
agents  are  worse  off  at  Pareto  Optimal  equilibria.  The  rates  of  extraction 
are  probably  a  better  standard  of  comparison. 

V.  Discussion 

This  paper  has  shown  that  the  use  of  threat  strategies  in  a  dynamic  game 


can  widen  the  possibilities  for  equilibrium  behavior,  even  when  the  threats 
are  required  to  be  credible.  In  essence,  the  players  in  such  a  game  have  a 
great  deal  of  leeway  for  threatening  each  other:  to  each  possible 
"cooperative"  choice  of  e.xtraction  rates  there  corresponds  a  unique 
sequence  of  stocks,  given  by  (3).  In  seeing  whether  this  behavior  can  be 
made  to  result  from  an  equilibrium,  players  are  allowed  to  choose  their 
e.xtraction  levels  freely  at  any  other  stocks.  If  this  is  regarded  as  a 
garni'  in  extensive  form,  the  players  are  allowed  to  take  the  period  at  which 
they  observe  a  particular  stock  into  account,  and  react  accordingly.  That 
is  to  say,  the  strategies  we  are  considering  here  are  explicitly  "closed- 
loop"  strategies.  This  is  in  contrast  to  the  stationary  strategies 
considered  by  Mirman  [1979],  which  have  the  property  that  current 
consumption  depends  only  on  the  current  stock. 

It  is  possible  to  construct  threat  strategies  that  depend  only  on  current 
stocks,  but  the  throats  would  be  empty,  since  players  would  be  unable  to 
tell  whether  a  given  stock  level  resulted  from  defections  or  from 
variations  in  the  initial  stock.  Threats  could  be  made  somewhat  more 
powerful  if  allowed  to  depend  on  both  initial  and  current  stocks,  but  they 
would  still  fail  to  deter  defections  of  a  certain  kind:  suppose  that  along 
the  cooperative  path  stocks  were  to  increase  monotonical ly  to  the  steady 
state.  A  player  could  unilaterally  increase  his  consumption  by  an  amount 
sufficient  to  ensure  that  the  stock  in  the  next  period  would  be  one  of 
those  that  had  already  occurred.  The  other  player,  if  using  an  open-loop 
strategy,  would  be  unable  to  detect  such  a  defection,  and  would  proceed  as 
if  notliing  had  happened. 

Such  partial  approaches  seem  unsatisfactory,  so  we  have  chosen  to  use  time- 
arid  state-dependant  strategies.  However,  this  game  is  played  under 
conditions  of  certainty:  the  growth  law  is  deterministic,  so  that  a  player 
knowing  the  sequence  of  his  own  past  consumptions  and  of  the  stocks  will 
also  know  the  past  consumption  by  his  opponent,  introducing  noise  into  the 
system  would  complicate  matters.  In  the  first  place,  even  if  the  players 
could  observe  each  other's  catch  the  optimal  policies  would  not  be 
stationary.  In  addition,  a  fluctuation  in  the  stock  observed  by  a  player 
could  result  from  either  natural  or  unnatural  causes,  and  the  posterior 
probability  that  a  particular  stock  reflects  deviation  from  cooperative 
behavior  is  not  well-defined  if  the  players  are  following  pure  strategies. 
The  alternative  is  to  use  a  concept  similar  to  trembling-hand  perfectness 
or  sequentiality  [Kreps  and  Wilson,  1982). 

In  the  deterministic  case,  we  have  characterized  a  set  of  subgame-perfect 
equilibria.  The  results  would  have  been  the  same  had  we  used  the 
trembling-hand  perfectness  concept,  since  each  player's  payoff  depends  only 
on  the  aggregate  of  the  other  players'  extractions.  Since  the  Cournot-Nash 
policies  are  undominated  in  this  game,  the  strategies  used  in  Section  III 
are  also  trembling-hand  perfect. 

Finally,  there  are  other  strategies  that  can  be  used  to  construct  subgame- 
perfect  equilibria.  For  instance,  any  perfect  equilibrium  path  can  be  used 
as  a  threat  if  a  particular  path  is  defected  from.  In  other  words,  we 
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could  choose  that  path  among  those  in  Section  III  with  the  lowest  payoff 
for  player  1,  and  threaten  to  play  according  to  this  rule  if  player  1 
defects,  and  similarly  for  player  2.  In  the  repeated-game  situation  this 
does  not  add  to  the  set  of  outcomes,  but  it  may  do  so  in  the  dynamic  game. 
In  fact,  to  any  pair  of  extraction  rates  h  =  (h(l),h(2))  we  may  associate 
"sufficient  punishments",  which  arc  other  extraction  rates  offering  one  or 
the  other  player  a  lower  payoff.  Lot  the  set  of  sufficient  punishments  for 
player  i  be  denoted  P(i,h),  and  the  intersection  P(h).  Any  h  for  which 
P(h)  is  nonempty  is  an  equilibrium  h.  Finding  a  sequence  h*  = 
h ( 1 ),..., h ( t ),..  .  such  that  h(t)  belongs  to  P(h(t-]))  for  all  t,  is 
necessary  and  sufficient  for  h-'  to  be  the  outcome  of  a  perfect  equilibrium. 
In  this  paper,  we  have  limited  ourselves  to  those  h*  for  which  the  Cournot- 
Nash  policies  belong  to  P(h""),  since  the  latter  are  fixed  points  of  the 
cor respondance  P. 


References 

1 .  Aumann ,  R . J . : 

Le^uje  Notes  in  Game  Theory , 

Stanford  University,  [1976] 

2.  Cave,  J.A.K.:  "Equilibrium  Behavior  in  Infinitely-Repeated  Games",  Ph.D. 
dissertation,  Stanford  University,  [1979] 

3.  Kreps ,  D.  and  R.  Wilson;  "Sequential  Equilibria", 

Econom<^t_r  i ca  ,  v  .  5_0  , 

no.  •+ ,  [1982],  pp .  863-894 

4.  Lewis,  T.  and  J.  Cowens :  "The  Great  Fish  War:  A  Cooperative  Solution", 
Californi.i  Institute  of  Technology  Social  Science  Working 

Pape r  448  ,  [1982  1 

3.  I.evliari,  I),  and  L.  Mirman:  "The  Grtnat  Fish  War:  An  E.xamplo  Using  a  Dynamic 
Cournot -Nash  Solution" 

Bell  -lotn'iial  of  Economics, 

[  198(1  j  pp.  322-344  “ 

b.  'lirman,  1,.;  "Dynamic  Models  of  Fishing:  A  Heuristic  Approach"  in  P.T.  Liu 
and  J.G.  Sutinen,  eds . 

Control  Theory  in  Mathematical  Economics, 

New  York;  Decker,  [1979J,  pp.  39-73 

He  i  nganur:, ,  .1 ,  P .  ,  and  N.  L.  Stokey;  "Oligopolistic  Extraction  of  a  Non- 
Renewable  Common - Pro()e rty  Resource;  The  Importance  of  Commitment 
in  Dynamic  Games",  Northwestern  University  Discussion  Paper  508, 

[1981  I 

8.  Selten,  R.  "Re-examination  of  the  Perfectiu^ss  Concept  for  Ecpiilibrium  Points 
in  Extensive  Games", 

International  .Jcnirnal  of  (lame  Tln^ry  Vol  .  4, 

[  197  5  j  ,  pp.  2.5-55 


END 

FILMED 


5-85 


DTIC 


